High-frame-rate full-vocal-tract 3D dynamic speech imaging.

نویسندگان

  • Maojing Fu
  • Marissa S Barlaz
  • Joseph L Holtrop
  • Jamie L Perry
  • David P Kuehn
  • Ryan K Shosted
  • Zhi-Pei Liang
  • Bradley P Sutton
چکیده

PURPOSE To achieve high temporal frame rate, high spatial resolution and full-vocal-tract coverage for three-dimensional dynamic speech MRI by using low-rank modeling and sparse sampling. METHODS Three-dimensional dynamic speech MRI is enabled by integrating a novel data acquisition strategy and an image reconstruction method with the partial separability model: (a) a self-navigated sparse sampling strategy that accelerates data acquisition by collecting high-nominal-frame-rate cone navigator sand imaging data within a single repetition time, and (b) are construction method that recovers high-quality speech dynamics from sparse (k,t)-space data by enforcing joint low-rank and spatiotemporal total variation constraints. RESULTS The proposed method has been evaluated through in vivo experiments. A nominal temporal frame rate of 166 frames per second (defined based on a repetition time of 5.99 ms) was achieved for an imaging volume covering the entire vocal tract with a spatial resolution of 2.2 × 2.2 × 5.0 mm3 . Practical utility of the proposed method was demonstrated via both validation experiments and a phonetics investigation. CONCLUSION Three-dimensional dynamic speech imaging is possible with full-vocal-tract coverage, high spatial resolution and high nominal frame rate to provide dynamic speech data useful for phonetic studies. Magn Reson Med 77:1619-1629, 2017. © 2016 International Society for Magnetic Resonance in Medicine.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Database of Volumetric and Real-Time Vocal Tract MRI for Speech Science

We present the USC Speech and Vocal Tract Morphology MRI Database, a 17-speaker magnetic resonance imaging database for speech research. The database consists of real-time magnetic resonance images (rtMRI) of dynamic vocal tract shaping, denoised audio recorded simultaneously with rtMRI, and 3D volumetric MRI of vocal tract shapes during sustained speech sounds. We acquired 2D real-time MRI of ...

متن کامل

State-of-the-Art MRI Protocol for Comprehensive Assessment of Vocal Tract Structure and Function

Magnetic Resonance Imaging (MRI) provides a safe and flexible means to study the vocal tract, and is increasingly used in speech production research. This work details a state-ofthe-art MRI protocol for comprehensive assessment of vocal tract structure and function, and presents results from representative speakers. The system incorporates (a) custom upper airway coils that are maximally sensit...

متن کامل

Three-dimensional Modeling of Tongue during Speech Using Mri Data

The tongue is the most important and dynamic articulator for speech formation, because of its anatomic aspects (particularly, the large volume of this muscular organ comparatively to the surrounding organs of the vocal tract) and also due to the wide range of movements and flexibility that are involved. In speech communication research, a variety of techniques have been used for measuring the t...

متن کامل

Segmentation and 3D reconstruction of the vocal tract from MR images – a comparative study

Speech production is an important human function involving a set of organs with specific morphological and dynamic aspects. The inter-speaker variability, the coarticulation or the nasality are some interesting aspects to improve a realistic 3D modeling of the vocal tract. For this, the understanding of the mechanism of speech production is crucial, as the current image data is not sufficient t...

متن کامل

The KTH 3D Vocal Tract project (Engwall, 1999) aims at realistic modeling of the intraoral articulator movement in speech, using a rule-based approach to visual speech synthesis

movement in speech, using a rule-based approach to visual speech synthesis (Beskow, 1995). The hope is that a realistic 3D model of the tongue, made visible in the frame of a synthetic face (Lundeberg and Beskow, 1999), as shown in Fig. 1, can be of use in pronunciation training to provide visual feedback to eg. hearing-impaired children. In the current state of the project, the model consists ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Magnetic resonance in medicine

دوره 77 4  شماره 

صفحات  -

تاریخ انتشار 2017